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HOEFFDING-ANOVA DECOMPOSITIONS FOR SYMMETRIC 
STATISTICS OF EXCHANGEABLE OBSERVATIONS 

By Giovanni Peccati 

Universites de Paris VI 

Consider a (possibly infinite) exchangeable sequence X = {Xn ■ 1 < 
n < A*'}, where £ N U {oo}, with values in a Borel space [A, A), 
and note X„ = {Xi, . . . ,Xn). We say that X is Hoeffding decompos- 
able if, for each n, every square integrable, centered and symmetric 
statistic based on X^ can be written as an orthogonal sum of n U- 
statistics with degenerated and symmetric kernels of increasing order. 
The only two examples of Hoeffding decomposable sequences studied 
in the literature are i.i.d. random variables and extractions without 
replacement from a finite population. In the first part of the paper 
we establish a necessary and sufficient condition for an exchangeable 
sequence to be Hoeffding decomposable, that is, called weak inde- 
pendence. We show that not every exchangeable sequence is weakly 
independent, and, therefore, that not every exchangeable sequence 
is Hoeffding decomposable. In the second part we apply our results 
to a class of exchangeable and weakly independent random vectors 
-j^(q,c) _ jX^"''^') whose law is characterized by a positive 

and finite measure a(-) on A and by a real constant c. For instance, if 
c = 0, xL"''^' is a vector of i.i.d. random variables with law 
if A is finite, a(-) is integer valued and c= —1, X^"''^' represents the 
first n extractions without replacement from a finite population; if 
c > 0, Xi"''^^ consists of the first n instants of a generalized Polya urn 
sequence. For every choice of a(-) and c, the Hoeffding- ANOVA de- 
composition of a symmetric and square integrable statistic r(xi"''^-') 
is explicitly computed in terms of linear combinations of well chosen 
conditional expectations of T. Our formulae generalize and unify the 
classic results of Hoeffding [Ann. Math. Statist. 19 (1948) 293-325] 
for i.i.d. variables, Zhao and Chen [Acta Math. Appl. Sinica 6 (1990) 
263-272] and Bloznelis and Gotze [Ann. Statist. 29 (2001) 353-365 
and Ann. Probab. 30 (2002) 1238-1265] for finite population statistics. 
Applications are given to construct infinite "weak urn sequences" and 
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to characterize the covariance of symmetric statistics of generahzed 
urn sequences. 



1. Introduction. For any N G NU {+cxd}, consider a collection X = {Xn : 1 < 
n < N} of exchangeable random observations, whose components take val- 
ues in some Borel space {A, A) and are defined on a suitable probability 
space {Q,T,¥) [the reader is referred to Aldous (1983) for any unexplained 
notion concerning exchangeability]. For 1 < n < and q > 0, we write X„ 
and L''(X„), respectively, for the vector and for the class of 

real-valued functionals r(X„) such that ElTl"? < -l-oo. Roughly speaking, we 
say that the sequence X is Hoeffding decomposable (or Hoeffding-ANOVA 
decomposable) if, for every n, any centered and symmetric T G L^(X„) can 
be uniquely represented as an L^-orthogonal sum of n [/-statistics based 
on X„, say Ti,...,r„, such that each Tj has a (completely) degenerated 
symmetric kernel of order i. In particular, if X is Hoeffding decomposable, 
for each n the covariance between symmetric statistics based on X„ can 
be represented as a sum of covariances between degenerated [/-statistics of 
the same order. The problem of writing the explicit Hoeffding-ANOVA de- 
composition of a given random variable is usually adressed to characterize 
the covariance and the consequent asymptotic behavior of such symmet- 
ric functionals of the vector X„, as nondegenerated [/-statistics or jackknife 
estimators [see Koroljuk and Borovskich (1994) and Serfling (1980) for a sur- 
vey], as well as [/-processes [see, e.g., Arcones and Gine (1993)]. However, it 
has been completely solved in only two cases: when X is a sequence of i.i.d. 
random variables [as first proved in Hoeffding (1948), see, e.g., Hajek (1968), 
Efron and Stein (1981), Karlin and Rinott (1982), Takemura (1983), Vitale 
(1990), Bentkus, Gotze and van Zwet (1997) and the references therein], 
and when X is a collection of — 1 extractions without replacement from a 
finite population [see Zhao and Chen (1990) and Bloznelis and Gotze (2001, 
2002)], and in both instances, the degenerated [/-statistics Tj turn out to be 
linear combinations of well chosen conditional expectations of T. 

The aim of this paper is twofold. 

On the one hand, we shall establish a necessary and sufficient condition 
for a general exchangeable sequence to be Hoeffding decomposable. Our 
main result states, indeed, that X is Hoeffding decomposable if, and only 
if, X is composed of weakly independent random variables. The notion of 
weak independence is introduced here for the first time, and will be formally 
explored in Section 4. To capture the idea of weak independence, suppose 
X = {Xi,X2,X3), then, X is weakly independent if, and only if, the following 
implication holds: 
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where (p is an arbitrary symmetric kernel such that E[i?i)(X2)^] < +00. We 
win see that not every exchangeable sequence is weakly independent, and, 
therefore, that not every exchangeable sequence is Hoeffding decomposable. 

On the other hand, we will apply the above results to explicitly calculate, 
for every n, the Hoeffding-ANOVA decomposition of a general, symmetric 
T E L^(X„), when X is a generalized urn sequence (GUS), a notion that will 
be introduced in Section 5. As discussed below, the family of GUS contains 
exclusively exchangeable sequences; examples are i.i.d. random variables, 
extractions without replacement from a finite population, as well as gener- 
alized Pdlya urn schemes [such as the ones introduced in Ferguson (1973) 
and Blackwell and MacQueen (1973)]. Consequently, our formulae will ex- 
tend and unify the classic results about ANOVA decompositions for i.i.d. 
variables and finite population statistics, and will show that exchangeabil- 
ity is quite a natural framework for studying ANOVA-type decompositions 
of symmetric statistics. Note, however, that exchangeability is not a neces- 
sary condition for a random sequence to be Hoeffding decomposable, see, 
for example, Karlin and Rinott (1982), Priedrich (1989) and Alberink and 
Bentkus (1999), where the authors study the case of independent but not 
identically distributed random variables. In a companion paper [see Peccati 
(2002a), but also Peccati (2002b, 2003)], we apply our results concerning 
generalized Polya urns to obtain a "chaotic decomposition" of the space 
of square integrable functional of a Dirichlet-Perguson process [see, e.g., 
Ferguson (1973)] defined on a Polish space {A, A). 

The paper is organized as follows: in Section 2 we introduce some nota- 
tion; in Section 3 we define the notion of Hoeffding spaces and establish some 
useful results about exchangeable sequences and (symmetric) [/-statistics; 
Section 4 is devoted to the relations between Hoeffding decomposability and 
weak independence; in Section 5 we prove our main theorems about GUS, 
whereas Section 6 is devoted to further examples, refinements and applica- 
tions. 

Part of the results of this paper have been announced in Peccati (2003). 

2. Basic notation. Fix n>l. For any m G {0, 1, . . . , n}, we define 
Vn{m) := {k(„) = {ki,...,km):l<ki<--- <km<n} 
with the convention k(o) := and Vn{0) = {0}. We also set 

Ko(m)= U Vn{m). 

n>m 

Forn > m > 1, = (h,-- • ,^m) € Vooim) and k(„) = (/ci, ...,kn)€ Vooin), 
A k(„) stands for the class 

{li : li = kj for some j = 1, . . . ,n} 
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written as an element of Foo(r), where r := Card{l(m) Ak(„)}. Analogously, 
for any n, m > 0, k(„) \ l^^^) will indicate the set {kj : kj ^ k \/ i = 1, . . . , m} 
written as an element of the class Voo{n — r). Again, given k(„) G Kx)(?^) and a 
vector h(^) = {hi, . . . , hm), by h(^) C k(„) we will mean that h(^) G Kx)(m), 
and that for every i £ {1, . . . , m}, there exists j G {1, . . . , n} such that kj = hi. 

As in the Introduction, we now fix G N U {+00} and consider an ex- 
changeable sequence X = {Xn : 1 < n < N} composed of random variables 
with values in the Borel space {A, A). By exchangeability we mean that the 
law of X is invariant under finite permutations of the index set {n : 1 < n < 
A''}. More to the point, when N < +00, X will always satisfy by convention 
the following: 

Assumption A. When A^ is finite, the vector X = (Ai, . . . , A7V-1) is 
composed of the first A^ — 1 elements of a finite exchangeable sequence 

(Ai, . . . , A27V_2). 

In the terminology of Aldous (1983), Assumption A implies that (Ai, . . . ,AAr_i) 
is a 2(A — l)-extendible exchangeable sequence. [We recall that, according 
to Aldous (1983), for 2 < M < +00, an exchangeable vector (Yi, . . . , Ym) is 
said to be (M + fc)-extendible (fc > 1) if there exists an exchangeable vector 
{Zi, . . . , ZM+k) such that 

{Yi,...,YMy={Zi,...,ZM). 

Of course, not every exchangeable vector is extendible.] This point will play 
an important role in the next section. Recall that if A = +00, and, therefore, 
X is an infinite exchangeable sequence, de Finetti's theorem [see Aldous 
(1983)] implies that X is a mixture of i.i.d. sequences. 
For any 1 < n < A, we define 

= (Ai, . . . ,Xn), 
Xo = 

and, for any n > and every j(„) G Vooin), we write 

^j(n) = (^Jl' ■ • ■ >^jn)- 

Now fix 1 < n < A^, and consider a symmetric and measurable function T 
on such that T(X„) G L^(X„). Then, exchangeability implies that for 
every <r <m <n, there exists a measurable function 

with the following properties: (a) for every G Vooin) and ^ ^^00(^71) 
satisfying Card{i(„) A = r, one has 

(1) E[r(Xj^„,)|Xi^^J = [r](^)„(Xi(„)Aj(„,,Xi^^^\j^„^) a.s.-P; 
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(b) for any fixed (ai, . . . , Ur) G , the apphcation 

{ar+i,...,am) ^ [T]'^jn{ai,...,ar,ar+i,...,am) 
is symmetric; (c) for any fixed (a^+i, . . . , am) € A^'^ , the apphcation 
{ai,...,ar) ^ [T]l^}^{ai,...,ar,ar+i,...,a„ 



is symmetric. We will denote by [T]^ ^ the canonical symmetrization of 
[T]^n^m, that is, 



?n! 
m 



■7r(m) V 



XI [^]n'^m(ajM'^(l,.-,m)\j(,)), 

where aj^^j = (oji, . . . ,aj^), for every r <m and every j(^) S V^(?'), and vr 
runs over all permutations of the set (1, . . . , m). 

3. HoefFding spaces associated to exchangeable sequences. 

3.1. Hoejfding spaces. Let the previous notation and assumptions prevail 
throughout this section. For a certain 1 < n < A^, we introduce the following 
notation. Set Uo = ^ii and, for i = 1, . . . , n, 

t^^(Xn) = v.s. {r(Xj^^,):r(Xj(j eL2(x„),j(,) eK(i)}'''^''"\ 

where v.s.{B} indicates the vector space generated by B, and eventually 
Ho = Uo, 

i/i(X„) = UiiXn) n C/^_i(X„)^, i = l,...,n, 

where i7i_i(X„)^ denotes, for every i, the orthogonal of C/j_i(X„) in L^(X„) 
[the reader is referred, e.g., to Dudley (1989) for any unexplained notion 
concerning Hilbert spaces]. We also set L^(X„) to be the subspace of L^(X„) 
composed of symmetric functionals of the vector X„ and eventually, for 
1=1,. ..,n, 

SUo = SHo = sft, 

) 

T:T= Yl <A(Xj(^)),</'(X.)Gi2(x,) 

j(i)GK,(i) J 
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where, in the last formula, the orthogonal is taken in L^(X„,). 

We define {i/j(X„) : i = 1, . . . , n} and {5i/j(X„) : i = f, n} to be re- 
spectively the collection of Hoeffding spaces and symmetric Hoeffding spaces 
associated to X„. It is immediate that the class C/j(X.„) represents, for a 
fixed i < n, the span of those functionals of X„ that depend at most on 
i components of the vector X„, and that the -ffj(X„)'s are obtained as a 
Gram-Schmidt orthogonalization [see Dudley (1989)] of the increasing se- 
quence {Ui(Kn)}- On the other hand, SUi{X.n) is the subspace of C/j(X„) 
generated by ^/-statistics, based on X„, with symmetric and square inte- 
grable kernels of order i. 

Given T G L^(X„), for every i = 0, . . . ,n, we will use the symbols 

TT[T,H.i\{Xn) and 7r[r, 5F,](X„) 
to indicate the projection of T on i7j(X„) and SHi{X.n). Of course, for every 

n 

r = E(r) + ^^[r,//,](x„) 

i=l 

and for every T G L^(X„), 

n 

r = E(r) + ^7r[r,5i/i](x„). 

i=l 

The rest of the paper is essentially devoted to the characterization of the 
operators 

7r[-,5//i](X„) :L2(X„) ^ SHi{y.n):T ^ TT[T,SHi]{y.n) 

for X belonging to some special class of exchangeable sequences. In partic- 
ular, we will be interested in sequences satisfying the following: 

Definition 1. The exchangeable sequence X is said to be Hoeffding 
decomposable if, for every 1 < n < and every 1 < i < n, the following 
double implication holds: T G SHi{X.n) if, and only if, there exists 

such that (/"^^(Xi) G L^(Xi), 

(3) E[0$^)(Xi)|Xi_i]=O, P-a.s. 

and 
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Of course, the crucial point in the above definition is given by (3). When 
(f)"^ is such that <?!>^^(Xj) € L^(Xj) and satisfies (3), we write 



It is well known that i.i.d. sequences are Hoeffding decomposable. As al- 
ready pointed out, this feature has been the key tool to study the asymptotic 
behavior of symmetric ?7-statistics, via the characterization of their covari- 
ance structure [see, e.g., Serfling (1980) and Vitale (1990)]. We will see in 
the next section that another archetypal class of Hoeffding decomposable 
sequences is given by extractions without replacement from finite popula- 
tions. 

3.2. Hoeffding decompositions for finite population statistics. In this sec- 
tion we shall shortly recall some of the findings of Zhao and Chen (1990) that 
will be useful in the following sections. Note that the theory of Hoeffding 
decompositions for finite population statistics has been further developed in 
the works of Bloznelis and Gotze (2001, 2002) that have inspired our pre- 
sentation. 

Fix M > 1. We note z = (zi, . . . , za/), a nonordered collection of M ele- 
ments of A, and we identify z with the measure on {A, A) given by 



We note Zm{A), the set of all such z. To each z G Zm{A), we associate 
the random vector 



where vr* indicates a random permutation, uniformly distributed over all 
permutations of (1, ... , M). In other words, Y^^ has the law of a vector of M 
extractions without replacement from a finite population whose composition 
is given by the measure fiz. The following result, that is essentially due to 
Zhao and Chen (1990), characterizes the class of symmetric Hoeffding spaces 
associated to = (y/*", . • • , 1"^"), when m < M. Of course, has the 
law of the first m extractions without replacement from z. 

Proposition 1. LetT £ L^(Y^f ), where z eZM{A) and m < M. Then, 
there exists a unique class of functions 



(A? GHi(X). 



l^.{C) = —Y,lc{z,), C^A. 



i=l 



Y^'^ = (yl^^...,y,t) = (^,.(l),... 





■.A' 



m 



i = 1, . . . ,m, 



Ebr,, (Yr^)lYf^ ] = 
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for every G Vm{i) and every G Vm{i - 1), and 

(4) n[T,SH,]{Y^^^)= 5??,.(Yj;p. 

(i) 

Moreover, gj- = when i > M — m, and also, 



/m\ /M—m\ 

(5) nAT,SH,]{Y^^f] = h ^M-m>^)n9?,,S^^,^^f\■ 



Formula (4) implies that Ylf^ is Hoeffding decomposable. Proposition 1 
will be used in the proof of the main result of the following section. The 
main reason of its usefulness is nested in the following basic result, whose 
proof can be found, for example, in Aldous (1983). 

Proposition 2. Under the previous notation, let Xm = (-'^i, . . . ,Xm), 
M<+oo, he a finite exchangeable sequence with values in {A, A). Then, 
conditioned on ^Xj,/, ihe law ofKM coincides a.s. with that ofY^^M ^ that 
is, a.s.-F, for every C € A'^^ , 

TV 

where vr runs over all permutation of (1, . . . ,M). 

3.3. Representation of U -statistics for exchangeable observations. To avoid 
trivialities, from now on we will systematically work under the following: 

Assumption B. For every l<i<n<N, i/i(X„) / {0} and S'i?i(X„) / {0}. 

Assumption B excludes, for instance, the case X„ = Xi for every n > 
1. Note that, under Assumption B, for each 1 < i < n (as usual, given a 
collection {A, Aj : j = 0,1, . . .} of Hilbert spaces, we write A = ^Aj to mean 
that Aj C A for every j , Aj _L Ai for i j and that every x ^ A admits 
the (unique) representation x = ^'K[x,Aj\, where vr stands again for the 
projection operator), 

n 

U.iXn) = 0i/„(X„) C L2(X„) = CZ„(X„) = HaiXn), 

a<i a=0 

n 

SU,{Xn) =^SHa(Xn) C L2(X„) = 5f/„(X„) =^SHa(Xn)- 

a<i a=0 

We shall now show that the elements of SUi(X.n) have a unique represen- 
tation. Our key tool will be the following result. 
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Lemma 3. Let X = {X„ : 1 < n < N} be an exchangeable sequence, sat- 
isfying Assumption A in the case of a finite N , as well as Assumption B. 
Then, there exist constants k(N,n,i) G (0, +oo), 1 < i < n < A^, depend- 
ing uniquely on N , n and i (and not on the law of Ji.) satisfying for ev- 
ery i = 1, . . . ,n, and every real valued </>(•), defined on A^ and such that 

\ 2i 



E 



(X 



> k{N,n,i)E[(j){'Xi 



Proof. We start with the case N = +oo. In this case, de Finetti's the- 
orem [see once again Aldous (1983), Section 7] yields the existence of a 
random probabihty measure D{-;uj) such that, conditioned to D, X is a 
sequence of i.i.d. random variables with common law equal to D. It follows 
that [noting (^)^ = (^)l(a>fe)]) due to symmetry and exchangeability, 

- / \ 2n 

E ^ 



^ E E 

j{«)6y„(i)'-=0 
i 

^ E E 

j(i)eV„(i)r-=0 



n — I 
i — r 



E[,/.(Xi)0(X„X,+i,...,X 



2i-r 



I \ n — i 



r I \ I — r 



E 



D^^{dai, . . . , dor 



'■2i-r) 



I D^'-^dai+i,...,da, 

X <^(«1, • • • ,arj fli+l, • • • J 0'2i-r 



> 



j(i)GV„(i) 



n 



Now we deal with a finite N . We recall that, in our setting, X is in this case 
of the form {Xi, . . . ,Xj\j-i), with X2Ar_2 = {Xi^ . . . ^X2N-2) indicating an 
exchangeable vector of 2(A^ — 1) random variables. Then, we use extensively 
the content and the notation of Propositions 1 and 2 to obtain, due again 
to symmetry and exchangeability, 

\ 2n 



E 



E 



10 



(6) 



E 



E 



L^j{oGV„(j) / 
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2 



E[0(Xi)|/iX2jv-2] 



+ E E 

'==lj(fc)GV„(fc) 



i — k 



and, therefore 

E 



2i 



fx, 



j(i)GV„(i) 

2 



(7) 



:E 



+ E 

fc=i 



/n—k\ 2 i2N —2—n\ /n\ 

-k) I fc AfciTD^r (fc) 



To be clear, the calculations contained in (6) and (7) are performed as 
follows. First, write the Hoeffding decomposition of 0(Xj^.j), under the con- 
ditioned probability E[-|/^X2]v-2] ™d for every G Vn{i)- Then, by using 
the relation 



(k) 



(^j(fe))l/^X2]V-2'^j(fc-l)] -0' 



-a.s. 



for every j[k-i) € Vn{k — 1) [that can be verified directly, by inspecting the 
proof of the main results of Bloznelis and Gotze (2001) or by using Corol- 
lary 9; i.e., not circular reasoning, as a matter of fact, to prove Proposition 8 
and Corollary 9, we do not need Lemma 3], observe that 



E 



n- k\ jk) 
i — k 



is the projection of X)j(j)eV„(i) <^(-^j(i) ) symmetric Hoeffding space 

associated to X„ under the measure P[-|/xx2jv-2]- Finally, use Proposition 1. 
Now write 



A;(A^, n, i) = min-! 



(■r:)'r-/-")C) , : 

,s = 1 I 
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to obtain, thanks to the Jensen inequaUty, 

E 

^j(,)GVn(i) 



> k{N,n,i)E 



E[0(Xi)|/iX2JV-2]' 



« / ■ \ 2 



k=l 



+ 1:1;) 



(Xfc)^|/iX2JV-2] 



> A;(A^,n,i)E 



+ Ee 



fc=i 



/^X2 



= /c(iV,n,z)E[E[,/.(Xi)Vx2^-2]], 
which yields the desired result. □ 

Remark. An inspection of the proof of Lemma 3 shows the relevance of 
the assumption: for a finite A^, X = {Xi, . . . , Xj^-i) is a 2(A^ — l)-extendible 
sequence. Suppose indeed that {Xi, . . . , A7V-1) are the first — 1 instants 
of a sequence Xm = (A^i, • • • , Xm), with N <M < 2N — 2. Then, according 
to Proposition 1, 



E E 

*: = lj(fe)GV^n{fc) 



i-k ]^</',Mx^/^JW^ 



min{i,M~i} 



n — k\ (fc) 



(Xj 



a.s.-P(-|;UXj, 



and, also, 



(8) 



E 



min{i.J\f— «} / 

E E 

fc=i Vj(,.)ev„{fc) 



i — k 



fx 



■j{fc)^ 



min{j, M—n} m—k\'^/M 



E 

fc=i 



k J^kJTw.r (k) n2| 1 



It is easily seen that, when i>M — i>M — n, relation (8) does not allow 
to conclude the proof of Lemma 3. 
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Lemma 3 has important consequences which are stated in the next two 
corollaries. 

Corollary 4. For 1 < i < n < N , suppose the applications (p o-iT-d <p' , 
both from to 3?, are such that (^(Xj), i;^>'(Xj) E L^(Xj). Then, 

implies 

<^(Xi) = 0'(Xi), F-a.s. 

Corollary 4 says that elements of SUi(X.n) admit an essentially unique 
representation as [/-statistics with symmetric kernel of order i. The next re- 
sult states that 5C/j(X„) contains exclusively random variables of this kind. 



Corollary 5. For I <i <n< N , 

SUi{y.n) = \T:T= J2 <^(Xj(J,</.(X,)eL2(X,)|. 

Proof. For fixed i and n as in the statement, just observe that if the 
family {T^'^ : / > 1}, defined as 

is a Cauchy sequence in Lg(X„), then Lemma 3 implies that (^^'^(Xj) is also 
Cauchy in L2(X,). □ 

4. Hoeffding decomposability and weak independence. For the rest of 
the section X will be a possibly infinite exchangeable sequence satisfying 
both Assumptions A and B. 

Definition 2. We say that the sequence X is composed of weakly in- 
dependent random variables (or that the sequence X is weakly independent) 
if for every 1 < n < and every T G L^(X„), 

[r];^„-il(X„_i) = 0, a.s.-P, 

implies 

[T]i^Li(Xn-i) = 0, a.s.-P, 

for every < r < n — 1 such that 2n — r<N, where the functions [T]^} and 
[T][] have been introduced, respectively, in (1) and (2). 
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Of course, independence implies weak independence. Another example 
of weak independence is given by sampling without replacement and, in 
general, by the class of GUS that we will discuss in the next section. However, 
not every exchangeable sequence is weakly independent. 



Example (A class of exchangeable sequences that are not weakly inde- 
pendent). Consider an infinite sequence 

X = {X„:n>l} 

with values in {0, 1}, whose law is determined by the following relation, valid 
for every n > 1 and every (ei, . . . , e^) G {0, 1}": 

r=i 

where e is a fixed constant such that < e < 1. 

This is equivalent to saying that, conditioned on the realization of a real 
valued random variable Y such that 



P(y G C) = e-^ [ dy 
J(o.e)nc 



(o,e)nc 

the sequence X is composed of independent Bernoulli trials with common 
parameter equal to Y. In this case, a necessary condition for X to be weakly 
independent is that for any symmetric </> on {0, l}'^ such that 

(9) E(0(Xi,X2)|X2) = O 
must also hold 

(10) E{<P{Xi,X2)\Xs)=0. 

We shall construct a symmetric (p that respects (9) but not (10). Define, 
indeed, 



and also 



(^(0,0) 

so that 



(1,0) = (/.(0,1) = 1 



Jq x{l — x)dx ^ 3 
Jq dx 2e ' 

llx{\-x)dx _ - (3/2)£ 



llil-xfdx 3-3e + e2' 

E(,^(Xi,X2)|X2 = 0) =e((/>(a:i,X2)|X2 = 1) = 0, 
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and also 

miX.,X,)\X, = 0) = i ^3 _ 3^ ^>^^^^^^,^ < 0. 

since e € (0, 1). 

It is interesting to note that by taking e = 1, one would obtain a weakly 
independent sequence. As a matter of fact, X is in this case a Polya urn 
sequence with parameters (1,1) (see the discussion below). 

The following result establishes a necessary and sufficient condition for 
Hoeffding decomposability. 

Theorem 6. The exchangeable sequence X is Hoeffding decomposable 
if, and only if, it is weakly independent. 

Proof. To simplify, we will systematically consider r.v.'s T such that 
E(r) = 0. Now suppose that the sequence X is weakly independent, and take 
r(X2) € Lg(X2). According to Corollary 5, there exists a function : A i— > 

such that E[((/>^^^(Xi))2] < +oo, and also 

Tl[T,SHi]{y.2)=4\Xl)+(t>T\X2). 

(11) tt[t,sh2]{-K2) = r(X2) - - 

Plainly, (/.^"^ € H2(X): as a matter of fact, for every bounded h on A and 
thanks to exchangeability and symmetry, 

E[0g)(X2)/i(Xi)] = iE[#(X2)(/l(Xi) + /l(X2))] 
= 0. 

Now take n > 2. To show that if G € S'i?2(X„), then there exists (f)Q G 
H2(X) such that 

(12) G= <^G(Xj(.)), 

j{2)GV'„(2) 

it is sufficient to show that representation (12) holds for random variables 
of the type 

G = 7r[F,5i/2](X„), 

where F is centered and such that F € 5C/2(X„). Thanks again to Corol- 
lary 5, we know that there exists a symmetric and square integrable kernel T 
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such that 

j(2)ey„(2) 

and also, with the notation introduced in (11), 

n 

7r[F, 5Fi](X„) = (n - 1) ^^^(X,), 

i=l 
j(2)6V„(2) 

As a matter of fact, 

F= y: in^i,,,) - 4\w - 

j(2)6F„(2) 

j(2)GV'n(2) 

n 

= E #(Xj^,,) + (n-l)E#(^^)- 

j(2)GKi(2) 4=1 

Moreover, for every /i such that E(/i(Xi)^) < +cx3, 

e( Y 4'^(Xj(,,)EMxo)=o, 

^j(2)eKj(2) j=i / 

since we have assumed that X is weakly independent. Now we use a recur- 
rence argument. Suppose, indeed, that there exists k>l with the following 
property: for every k < n < N , for i = l,. . . ,k — l, F £ SHi(X.n) implies that 
there exists (p^p G Hj(X) such that 

(13) ^= E 

and observe that we have verified such a claim for k = 1,2,3. Given k, we 
shall verify that for every n>k, a, random variable of the type 

G = 7T[F,SHk]{Xn) 

for a generic F G SUk(^n) has the representation (13) for i = k and (f)^p G 
Hj(X). To see this, start with n = k, and take a symmetric and square 
integrable kernel T such that E(r(X„)) = 0. Then, there exist (pj) G Hj(X), 
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i = 1, . . . , A; — 1, such that 



7:[T,SH,]{X,)= ^'(^k 



i = 1, . . . ,k — 1, 



j(z)6Vfe(i) 



(14) n[T,SHk]{X,)=T{Xk)-Y^ ^ ^^X^^, 



Since for every bounded and symmetric function h on A with the form 

k-l 

h{ai,...,ak-.i) = J2Il lc,(a^(i)), 

TT j = l 

where Ci, . . . , C^-i € A and vr runs over ah permutations of (1, . . . , A: — 1), 
we have 



= IE 



j(fc-i)eVfc(fc-i) 



(k) 

due to exchangeabihty and to the symmetry of (prp , we obtain immediately 
(p^^ G Hfc(X). Now, for n> k, take F G SUk(X.n) with the form 

j(fe)GKi(fc) 

where T is a centered, square integrable and symmetric kernel. Then, by 
using the same notation as in (14), 



k-l 



}(k)&Vn{k) *=i j(i)ev„{j) 



n — i\ (i) 



=i j(i)ev'n(i) ■ 

and, moreover, for every h on A''"^ such that /i(Xfc„i) € L^(Xfc„i), 



E 



E E MX 



-j(fe-i). 



■j(fc)Gyn(fc) 



j(fe_l)GV„(fc-l) 



0, 
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since for every j(fe„i) e Vn{k - 1), 



j(fc)ey„(fc) 
fc-1 

= E E ^(Card(j(fc)Aj(fc„i))=r)[</'T ]fc,fe-l(^j{fc) Aj(fc_i) ) ) 
'^=Oj(fc)Gy„(fc) 

fc-1 

= E E E ^(j(fe)Aj(fc_l)=j(^))[<;^T ]fc,fc-l(^j{r)'^j(fc-l)\j(r)) 

^ = Oj(r)Cj(fe_l) j(fe)GKi(fc) 

and, therefore, 

E IE[#(Xj^,,)|Xj(,_J 

j(fe)el^n(fc) 



X" V- (n-k + l\ r,{fc)n(r) ^ 

I jt-r / ^"^T Jfc,fc-n^j{r)'^j{fc-l)\j(r 
'•=Oj{r)Cj(fc_l) ^ ^ * 



E( ) ( r ) ['^T ]fc,fe-i(^j(fc-i). 





(k) 

thanks to the assumption of weak independence and to the fact that 0^ G 
Hfc(X). 

On the other hand, it is clear that if X is weakly independent and, for 
l<i<n<A^, F has the representation (13) for 0^*^ G Hj(X), then for any 
e Vn{i - 1), 



r=0 



E[i^|Xj,_J=E( 7-;) ( r ) ['^^1m-i(^J(.-J=0' 



and, therefore, F G 5ffj(X.„). 

Thus, we have shown that weak independence implies Hoeffding decom- 
posability. To deal with the opposite implication, suppose for the moment 
that = +00, and that X is Hoeffding decomposable in the sense of Defi- 
nition 1. For a given A; > 1, consider a certain r(Xfc) G Lg(Xfc) such that 

[Tt,k-i{^k-i) = 0, P-a.s. 

Then, 

j{fc)eVfe+i(fc) 
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that yields, due to exchangeability and symmetry, 
= E[F(Xfc+i)|Xfc_i] 

j(fe)eVfe+i(fe) 

k-l 

= E E E l{j(fe)A{l,...,fc-l)=jM) 

r=fc-2j(,,)eVfc_i(r)j(fc)gVfc+i(fc) 



E [^]i^fc-i(^j{*-2)'^ 

j(fe_2)eVfc_i(fc-2) 



■j{*-2)'-^(lv,fc-l)\j{*-2)) 



(A: -2)!' 

Now we use again a recurrence argument. Suppose, indeed, that the 
Hoeffding decomposability of X implies the following relation for every 
r(Xfc)GL2(Xfc): 

for a certain 2 < j < k — 1, and every 2 <l < j. Then, if T is such that 
[r]iffcJ^i = 0, we must have 

j(fe)eVfe+j(fe) 

that implies, again by exchangeability and symmetry, 
= E[F(Xfc+j)|Xfc_i] 

= ^ E[r(Xj^,,)|Xfe_i] 

j(fe)GVfe+j(fc) 
k-l 

= E E E l(j(fc)A(l,...,fc-l) = j(,)) 

r=fc-j-lj(^)eVfe_i{r)j(fc)eVfe+j(fc) 

^ [^]B-l(^j(r)'^{l,.-,fc-l)\jM) 



E r ) [^]fe,fc-l(^jM'^{l,.-,fc-l)\jM' 

k-l \~{k~j~l) 
f^_j_ljl^\k,k~l [-^k-l) 
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and, therefore, the desired result. To deal with the case of a finite N, just 
repeat the same argument for j such that A; + j<A^ — 1. □ 

One immediate consequence of Theorem 6 is the following: 

Corollary 7. Let the exchangeable sequence X be weakly independent. 
Then, for every 1 < n < N , every T(X„) S L^(X.„) and every i = 1, . . . ,n, 

7T[T,SHi]{Xn)=7T[T,H,]{Xn). 

Starting from the next section we analyze the specific case of GUS. 

5. The case of GUS. In this section we shall investigate the case of 
GUS, which represent a fundamental example of Hoeffding decomposable 
sequences. We will consider uniquely the case: {A, A) is a Polish space en- 
dowed with its Borel cr-field. More precisely, for € N U {oo}, and writ- 
ing J^{A) for the class of finite and positive measures on A, we say that 
a sequence 

is a GUS of parameters a G M{A) and c G 5R, if a{A) + c{N - 1) > and if, 
for every k and every j^;,) e VAr„i(A:), 

(15) mt' ^ . ■ . .4"' ^ - n ""^iii^g?;;""' . 

where :=c6x{-), with Sx{-) the Dirac measure concentrated in x. Note 
that (15) is equivalent to the following relation: for every C £ A and for 
every n < N, 

(16) P(4-^) G . . . , X._,) = "^^l+^Y'^lf^ 

a[A) + c{n — 1) 

Equations (15) and (16) imply that, for every choice of a and c s.t. a{A) + 
c{N — 1) > 0, the sequence X^'^''^^ is exchangeable. One can think of A as 
an urn whose composition is determined by the measure a(-) (thus, A could 
contain a "continuum" of balls), whereas X^'^''^^ represents a sequence of 
extractions from A according to the following procedure: at each step, one 
ball is extracted, and (1 + c) balls of the same color are placed in A before 
the subsequent extraction (one should substitute "placed in" with "elimi- 
nated from" when c < —1). Note that the assumption a{A) + c{N — 1) > 
ensures that the urn is not exhausted before the (A^ — l)st step; more to this 
point: when c = 0, X^"''^) is a sequence of i.i.d. variables with common law 
a{-)/a{A); if A = {ai, . . . , 05}, a is the counting measure and c = —1, then 
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we must have a{A) = S > N — 1 and X*^"'"^) has the law of the first — 1 
extractions without replacement from the finite population {ai, . . . , as} [this 
is the case studied in Zhao and Chen (1990) and Bloznelis and Gotze (2001, 
2002)]; when c > and N is infinite, X^"''^) is a generalized Polya urn se- 
quence whose directing measure [in the terminology of Aldous (1983)] is a 
Dirichlet-Ferguson process on (yl. A) with parameter a{-)/c [the reader is re- 
ferred to Ferguson (1973), Blackwell and MacQueen (1973), Blackwell (1973) 
and Ferguson (1974) for definitions, proofs of the above claims and discus- 
sions of the relevance of such objects in Bayesian nonparametric statistics; 
see also Pitman (1996) for a rich survey of some recent developments of Polya 
urn processes]. Note also that, in all cases, the law of X^"'*^) is characterized 
by the following two facts: (i) for every j < N, F{Xj"'^^ G dx) = a{dx)/a{A), 
(ii) for every j < N, the law of 

{X^Z^:l<n<N-j} 
under the probability measure 

p(.|x;"''=)=xi,...,xf'^)=x,) 

is that of a GUS of length N — 1 — j and parameters a(-) + J2k=i,...,j (') 
and c. 

To be sure that Assumption B is satisfied and that we work with 2(A^ — l)-exten- 
dible sequences, we will systematically assume that a{A) + c2(A^ — 1) > 0. 
For instance, in the case of extractions without replacement from a finite 
set of cardinality a{A) S N, this condition is necessary and sufficient both 
to have 2(iV — l)-extendibility and to satisfy Assumption B. [More precisely, 
consider the case of extraction without replacement from a finite set A, and 
suppose that Card(^) = S > 0, and that S/2 < N — 1 < +oo. In this case, 
it is easy to see that every symmetric statistic of {x['^'^\ . . . ,X^j^fl) is con- 
tained in the space SUs~n~iP^^n-i) ' i-^-' the projection of any symmetric 
statistic on 

N~l 
k=S-N 

must equal zero; see Bloznelis and Gotze (2001), Proposition 1, for a com- 
plete discussion of this point.] 

One nice feature of GUS is that they are weakly independent, and, there- 
fore, thanks to Theorem 6, Hoeffding-decomposable, as shown by the fol- 
lowing: 

Proposition 8. Let X*^"'^) be a finite GUS satisfying the assumptions 
of this section and, for a fixed I < n < N , consider a symmetric T(X^"''^^) G 
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L^(xi°''^^). Then, for every m = 1, . . . ,n and for every G VN-i{n) and 
every i(^) G VN-i{m), the following equality holds with probability one: 

(17) 

X . . E . mSl(x]:f ), 

i(7n)/^j(n)Cj(q)Ci(m) 

«;/iere [r]l°J, = E(r), 

/?g,m,r(a(^),c) 

1, q = m, 

{a{A) + c{m — 1)) X • • • X {a{A) + cq), r <q <m — 1 

and r = = Card(i(m) /\ j(n))i '^'^'^ oZ/ conventions are as before. 

Proof. To prove (17), consider a vector G VN-i{n), as well as an 
index i ^ it is easily verified that 

. .(0), ia,c). _ nc (1) (»,c). a{A) (p) 

that gives (17) for m = l. To show the general case we use once again a recur- 
rence argument. Assume, indeed, that the result is proved for m = 1, . . . ,k — 
1: we recall that for every G VAr-i(A:), for any fixed x^. = (xi, . . . , Xr) G , 
under the probability measure 

where r = rfit^) A hn)) is defined as in the statement, the vector X^"''^? . 

is a finite GUS of length n — r and parameters a(-) + J2i=i r ^xi (') ^.nd c. 
Now fix G VN-i{k) such that r > 0. The recurrence assumption, along 
with the obvious relation (i^^,) \j[n))^{i(n)\ i(fc)) = implies 

,t^o nt7(«(^)+c(n+z-i))'^'''' 
X E mi74K,x£f). 

j(g)C:i(fe)\j(„) 
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But 

Pq,k-rfi{aiA) +cr, c) 

^ri, q = k-r, 

\ {a{A) + c{k-l)) X ■■■ X {a{A) +c{r + q)), 0<q<k-r -I 

and the change of variables p = q + r yields immediately (17). We are left with 
the case i^^) A = 0: to see that the statement is still valid, fix Xj^ G A and 

i(fc-i) 



Xj = {xi-^ , . . . , J ) G ^ and write, due to the recurrence assumption. 



[^]n,l(^«fe'^i(fc-i)) 

= E[r(xS°''^))|x(°''=)=Xi,,xf"'';). =x= 

^ ^ J{n) «fe 'fe' l{k)Vk Hfc-l)J 

^ nLi(^-^ + i) — ^ (^^^^ + ^) 
,t^o nt~i'(«(^)+c(^+0) ' 

^ X] [^]i'g+l(^j(g)'^«fc) 

j(g)Ci(fe)\jfc 

Ut=Ha{A) + c{n + l)y 
X ••• X (a(A) +c(g + l)) 

J(q)Cl(fc)Vfe 

where xj^^^ stands for (xj^, . . . ,Xj^), giving the desired conclusion. □ 

Actually, Proposition 8 yields much more than weak independence. As a 
matter of fact, we have the following: 

Corollary 9. Let X("'^) be a GUS as in Proposition 8, and fix 1 <n < 
N and m<n: if a symmetric T on A^ is such that T(xi"''^^) G L^(xi"''^^) 
and 

(18) [r]("^(X{;^'^)) = 0, F-a.s., 

then 

mSL(X(;^''=))=0, F-a.s. 

for every r <m and such that n -\- m — r < N . In particular, if r(xi'^''^^) G 
L2(x1"'^^) and T satisfies (18), then T(xj"jj')) G [/„(xS^'''V for every 
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j{n) £ Vn-i{i^) <ind m < M < N, where UmO^^M''^^) denotes the direct sum 
of the first m Hoejjding spaces associated to 'K^^/^\ and the orthogonal is 
taken in L^(X^"''^^). This implies that X^"''^-' is weakly independent. 



We have also the following generalization of the calculations contained, 
for example, in Bloznelis and Gotze [(2001), formula (2.5)]. 

Corollary 10. Let X^"''^) be a GUS. Take T and V square integrable, 
symmetric on and satisfying the hypotheses of Corollary 9 for m = 
n — 1 [i.e., T, y G H„,(X("''^))].- then, for every G VN-iin) such that 

Card(i(„) A j(„,)) =r, 

(19) 

n—r 7 _i_ 1 

=°""' n „(;);:(;;,_i) ^F(x!r')v(x(°.^')i. 

We now want to calculate the explicit form of the Hoeffding-ANOVA 
decomposition for urn sequences. 

5.1. Hoeffding decompositions for GUS {statements). Now consider a 
sequence X^"''^) that is a GUS in the sense of the previous section, and 
fix 1 < M < A'^. Most of the subsequent results are related to the following 
sequence of real constants associated to the law of X^"''^) : 

(20) Hnmrp)-^im-r), 11-^'' HA) + c{r + p + s - l)\ 

where 1 <m <n < M, < r < m, < p < m — r, a{A) + c(n + m — r) > 0, 
(a)(b) := a\/bl for a > 6 and ns=i = 1 = O*' by convention, and, for 1 < q < 
m<n< M, 

(21) ^M{q,n,m):=Y,(l) (^^Zr) H'n,m,r,q-r) 



r=0 



with (^)^ := (^)l(a>6)- We are now in a position to state the main result of 
the section. 

Theorem 11. Under the previous notation and assumptions, fix a ^ 
^A{A) such that ^nfiQ, n,q) for every n = 1, . . . , M and every 1 <q <n. 
Write also, for any k>l. 



(22) 



^i^:={^M{k,k,k))-' 
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with the notation introduced in (21). For every s = 1, . . . ,M —1, the following 
equality holds a.s.-F for any T € L^(X^''^'*) with E(T) = 0/ 



a=l 



(23) 



j(a)6VA/(a) 



j{a)eVA/(s) La=l 
-1 



EC':\ E imUK: 

j(a)Cj(s) 



(".c)^ 
) ^ 



where O^j^f^^ ■~ ^M'^\^-a) ^'^^ coefficients 6^]^/""^ are recursively de- 



fined by the set of conditions {Sjv/(^)) k = 1, . . . ,M — 1} given by 
(24) SM{k) :-- 



Ak,k) _ (k) 

k i 

J2J2^M^'^Miq,k,j)=0, q=l,...,k-l, 



and, consequently, 

vr[r,s//M](xir^) = E E 



M 



a=lj{a)GVA/(a) 



where 9 
1. 



{M,a) 



M 



■ ^Ir^ /or a = 1, . . . , M - 1 and <f = *m (M, M, M)-^ 



Note how the above assumptions, concerning the constants ^'a/(-, •, •), are 
immaterial in the case c > 0. It is also clear that Theorem 11 can be applied 
to noncentered symmetric statistics by considering T' : = T — E(T). 

The statement of Theorem 11 can be further refined by means of Theo- 
rem 6 and Corollaries 4 and 10. Indeed, the symmetric functionals 



(25) 



Ecf E mSa(x]:f) 

1=1 j(a)Cj(s) 



defined for s = 1, . . . , M and for coefficients (note that ^j^^^' ^ = ^) 
as in (23), are uniquely determined (thanks to Corollary 4), and such that 

[#]t-)(xtf) = 0, F-a.s., 

since the X^"'*^) is weakly independent and, therefore, Hoeffding decompos- 
able. Moreover, Corollary 9 yields 

[#]a-i(X?_'?)=0, P-a.s., 
for every 2s — r < N + 1. 
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Remarks, (a) It is interesting that, for any fixed M, the coefficients 9jyj 
appearing in (23) depend on the law of X^'^''^^ only through the quantity c/a{A) , 
that can be interpreted as the (initial) rate of replacement associated to the 
GUS X^"''^). It follows that the Hoeffding-ANOVA decompositons of two 
different finite GUS with the same rate of replacement can be obtained by 
first calculating the (M — l)-ple of functions [7"]^/ j(')) « = 1, • • • , -/Vf — 1, and 
then by implementing exactly the same algorithm. 

(b) The above discussion shows that, not only a statistic T € SHi (X^"'"^^ ) , 

i < M, is uniquely determined by a function (p^^ € Hj(X), but also that such 
function can be "recovered" from T, through (25). 

(c) Note that the recursive relation that defines the coefficients ^ is 
different from that deduced in Zhao and Chen (1990) or Bloznelis and Gotze 
(2001) for the case of A being a finite set with cardinality S > 2M, endowed 
with the counting measure a. However, Corollary 4 ensures that the results 
implied by Theorem 11 and those in the references above are equivalent. 
One can also compare the explicit computations of the parameters 

(2, I'l ('2 2") 

9j^' and 6\_j that appear in Bloznelis and Gotze [(2001), beginning of 
page 901] with those exhibited in Section 6.1. 

Examples and applications of Theorem 11 are given in the next section 
and, to a much wider extent, in Peccati (2002a, b, 2003). Now we establish 
some relations that are used to prove Theorem 11. 

5.2. Auxiliary calculations. Let X^"'"^) = X be a GUS as in the previous 
section (the dependence on a and c is tacitly dropped to simplify the nota- 
tion whenever there is no risk of confusion). The following result is the key 
step of the section: 

Proposition 12. Let the previous notation prevail, andfixm, n, M such 
that l<m<n<M<N, as well as vectors i(^) S VA,/(m) and j(^) G VMin). 

Then, for every symmetric T G L^(Xm), a version o/E[[T]^)„(Xij^j)|Xj(^J 
is given by 

m-'"(i(77i)'j(n)) 

(26) E ^('^."^>^>p)mM7i(Xj,,Ai(„),x,(^)), 

P=0 l(p)Cj(„)\i(„) 
where r = ?'(i(m)5 j(n)) = Card(i(,;„) A j(-„)) and the <&'s are given by (20). 

Proof. By the symmetry of T and of the distribution of the vector 
Xjv/, we can assume without loss of generality that = (1, . . . ,n), A 
i(m) = (l,---,'^(i(m)J(n))) and ir+t > n + 1 for t = l,...,m- r(i(^), j(„)). 
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Note that when > j(n)) = formula (26) is trivial and we shall therefore 
assume that r = G {0, . . . , m — 1}. Now observe that, thanks to 

exchangeability, straightforward calculations yield 

E[m£L(Xi(„))iXj,j 



a{A) + c{n + s — l 
x[T]^MlniXi,...,Xr,yu...,m 



a.s. 



and one can, moreover, rewrite the product measure inside the integral ac- 
cording to the following formula: 



n 

s=l 



s-1 



a=l 



a=l 



n 



=1 



a 



{dys) + J2^xAdys) + Y.^yAdys) 

a=l a=l 



(27) 



+ E \{^x,Sdys 



m—r—l 

+ E E 

p=l r+l<li^---^lp<n 
h(p)C(l,.--,m-r) 



X n 

9=1 



9-1 



+ E^xAdyt,) + Y.^yJdyt. 



a=l 



a=l 



where in the last summand we used the notation 

^(m— r— p) ~ (^1 5 • • • 1 ^gj • • • 1 tm—r—p) 

■■= (l,---,"i-r) \h(p). 

Note that (27) can be easily shown for m, say, equal to 2, whereas the 
general case is proved by a standard recurrence argument. To conclude, use 
once again symmetry and exchangeability to have 
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rriW /-rr'' [a{A) + c{r + s - 1)] 

^ JM,rl^(j(„)Ai(^)); n j^(^) + c(n + S - 1)] 

nr-rK^)+c(n+s-i)], [^jA/,mi^.(„)Aj,„),^i(_)j 

m-f'{i(m)J(n))-l r 

P-^ l(p)Cj(„)\i(m) L 



llTjna{A) + c{n + s-l)] 
which agrees with (26) and (20). □ 

From Proposition 12 we obtain the following: 

Corollary 13. Under the assumptions of Proposition 12, for a fixed 
j(n) e VM{n), a.s.-F, 

m 

E IE[[r]l™L(Xj(„,)|Xj,„,]=^ ^M(<7,n,m)[r]g,(Xj^J, 

j(m)6VAf(m) g=Oj(,)Cj{„) 

where the ^''s are defined as in (21). 

Proof. Straightforward computation, along with Proposition 12, yields 

E nm^MU^kji^sj 

j(,n)eVA/(m) 



^E E 

9=0j(g)Cj(„) 



E l(j(,n)Aj(„)Cj(,))^(">"^,^W-0 
■j(m)GVAf(m) 



where r := r(j(^) A j(„)) = Card(j(m) A j(„)) as before and a simple combi- 
natorial argument gives the desired result. □ 

Another consequence of Proposition 12 is: 

Corollary 14. Under the assumptions and notation of this paragraph, 
let T E L^(Xa./) for some 1 <n < M < N , and such that 

E[r|X:(„_J = 
for every i(n-i) ^ ^M(n - !)• Then, 

7r[r,5i7„](XM)=7ff E [^]Kn(Xj(.)) 

j(n)GVA/(n) 
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with 7^'* defined according to (22). 

Proof. We shall find a constant q such that, a.s.-P, for every G 
VM{n), 

But, thanks to Corollary 13 and the hypotheses in the statement, we can 
explicitly compute 

E n[TtSn(^i,J\^i,J = [r]g„(Xi^„,)*M(n,n,n), 

j(n)GVA/(n) 

thus concluding the proof. □ 

5.3. End of the proof of Theorem 11 . To obtain the coefficients appearing 
in (23), just write for s = 1, . . . ,M — 1 and a given S Vm{s), the a.s. 
condition 

(28) mS,.(Xj^.,)-EE<''^^ E E([r]g,(Xj^^,)|Xj^J = 

6=1 a=l j(a)eVA/{a) 

and observe that exchangeability and symmetry imply that if the ^m's sat- 
isfy (28) for one S Vm{s), then a.s. they satisfy the same condition for 
every element of Va//(s); but the left-hand side of (28) can be rewritten, due 
to Corollary 13, as 

mSs(Xj(j(i-^lr^^M(s,.,.)) 



E E 

9=lj(9)Cj{,) 



b 



EE^M *M(<?,.,a) 

b=q a=q 



that implies (24). The last assertion in the statement of Theorem 11 is just 
plain algebra, and the proof is therefore concluded. 

6. Examples and applications. 

6.1. Examples {maxima and minima). In this section we first consider 
a finite GUS, noted X^'^''^^ for which we calculate the first two terms of the 
Hoeffding-ANOVA decomposition of a T G L^{X.'-^/''^) for M > 2. Then, we 
apply such a result to the case of the simplest order statistics associated to 
a real valued finite GUS. 
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Now consider a finite GUS X^"''^^ satisfying the assumptions of The- 
orem 11: it is easily seen that ^^^'^^ = ia{A) + c)/{a{A) + cM), O^^j"^^ = 
{a{A) + 3c) X {a{A) + 2c)/{a{A) + Mc){a{A) + c(M + 1)) and 



n(2,l) 

hi 



(M - l)(a(A) + 3c){a{A) + c) 
' (a(A) + cM){a{A) + c(M + 1)) '' 

so that, for any symmetric and centered T G L^(X^''^^), 

7r[r,5i7i](XM) 



_ a{A)+c Y^r^.(i) 
"a(A)+cM^^^^J^^'i^^^ 

7r[r,5i72](XM) 

_ (a(A) +3c)(a(A) +2c) 
~ (a(^) + cM)(a(^)+c(M + 1)) . 



j(2)6V^A/(2) 



(2) 



M 



{M-l){a{A)+3c){a{A)+c) ^ 
(a(A)+cM)(a(^) + c(M + l))^^ ^^'^^ ^ 



i=l 



E 



j(2)6VA/(2) 



{a{A)+3c){a{A) + 2c) 
{a{A) + cM)(a(^) + c(M + 1)) 

(a(^) + 3c)(a(^) + c) 
" (a(A) + cM)(a(^) + c(M + 1)) 



rT^i(2) 



E 



(1) /-^{o,c)x 



iCj(2) 



Suppose also that Ac^:we shall compute the three quantities E(r(Xy^"''^^)), 
[-^]m i(^) [^]m 2('^i' -^2) associated to the symmetric statistics T(xj^"''^^) = 
max(xj"''^\ . . . ,X^'^^) (the same calculations hold for the minimum), so to 
write the first two terms of its Hoeffding-ANOVA decomposition. In this 
case, it is easily seen that 



Af-l 



(29) 
where 



E(r(xj^''''*)) = V n{k,c,a{A)) / max{xi, . . . ,XM-k) 

xa®(^^-'=)(dxi,...,dxM-fc), 



n{k,c,a{A)) := c'' 



M-l 

E' 

41=1 



M-l 



■ E n^^ 

2fc=lfe_l+l \s = l 



M 



Yl^a{A) + c{t - 1))-' 



t=i 
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and, again, 

M-2 
k=0 



M-l-k 



M-k-1 



i=0 



M-k-l-i 



X / ma'x{xi, . . . ,Xi, z)a {dxi, . . . ,dxi) 
} Qii^c^aiA)) I maxfxi, . . . , Xj, 2:)a®V(ixi, . . . , drEj), 



1=0 



where n'{k, c, a{A)) := c, a(A)) n|ii(a(^) + c(j - 1))/ n,=l'(«(^) + ci) 
and 

(M-2)A{M-l-i) /M _h-^\ 

C(i,c,a(A))= n'{k,cMA))r' f ^^M-k-i-i 

and, eventually, 

M-3 

= E n"{k,c,a{A)) 

k=0 



i=0 j=0 



■J 



X / ma-x.{xi, . . . ,Xj, zi, Z2)a'^-^ {dxi, . . . ,dxj) 



M-2 „ 

= E C'(j,c,a(A)) / max(xi, . . . ,Xj,zi,Z2)a®-'(d3;i, . . . 

with n"(A:,c,a(^)) := n'(fc, c, a(yl)) nliT'(«(^) + cj)/Y{fj^\a{A) + c{j + 
1)), and 



C'(j,c,«(A))=E E' n"(A.,c,a(yl))(^-^^-')(i)c^™-^- 
and, therefore, by noting Qii{z) = /^i max(xi, . . . . . . , (ixj) and 

Q;,(....) = /^^ma.(.....,.,..„.,)««(<ix.....d.,). 
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we obtain 



a{A) + cM f-^^ 



and, finally, 



{a{A)+3c){a{A)+2c) 
{a{A) + cM){a{A) + c(M + 1)) 



j(2)GVA/(2) 



V k=0 

a{A) + c 
a{A)+2c 



J2c'{k,c,a{A))Ql,{X^^)-E{T) 



fM-l 



Y: { EC(fe,c,a(A))Q?,,(X^^))-IE(T) 



«Cj(2) V k=0 



where E(T) is given in (29). This example shows in particular that, even 
if the coefficients determining the decomposition of a symmetric statistic 
depend exclusively on the rate of replacement associated to the GUS, the 
whole decomposition strongly depends on the form of the associated mea- 
sure a [in this case through the functions Q"{-)]- To conclude, observe that, 
for c = and a{A) = 1, the above calculations reduce to the usual formulae 
for i.i.d. random variables. 



7r[r, 5i7i](xif)) = ^[Q?,^,_i(xf '"V Em] 



M 



(a,0)^ 



i=l 



7r[T,5/72](xlf))= Y: 

j(2)6VA/(2) 



(Q^,M-2(x£f)-IE(T)) 



E (Ql,M-l 

iCj(2) 



E(r)) 



6.2. Weak copies of exchangeable sequences. The content of this section 
is inspired by Follmer, Wu and Yor (2000). Given an infinite exchangeable 
sequence X with values in a Polish space (^4,^), and k>l, we say that 
a random sequence Y = {Yn'.n > 1} is a k-weak copy of X if, for every 

j(fc) £ Vooik), Yj^^j ^^Xj^^j. Plainly, X is a k-weak copy of itself for each k: 
however, one may wonder whether there exist k-weak copies of X for some k, 
whose law differs from that of X. Such a problem can be solved by means 
of the theory developed in this paper: the next proposition shows that the 
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answer is positive for some class of weakly independent sequences (containing 
infinite GUS), and that, moreover, exchangeability is preserved by any k- 
weak copy of X constructed by our techniques. We will note by D{-;u;) the 
directing (probability) measure of the infinite sequence X. 

Proposition 15. Suppose that the infinite exchangeable sequence X is 
Hoeffding decomposable, and that there exists some bounded and symmet- 
ric T on A^^^ such that T S Hfc+i(X) and 



Consider, moreover, the canonical space {A°^ , A®°° ,f) , where P is the law 

of X. Then, there exists a random sequence Y^^^ = {Yjt^ :n > 1}, with ele- 
ments taking values in {A^ A) and with law Qfc(-), that has the following prop- 
erties: 

1. Qfc<IP; 

3. Y^'^) is exchangeable; 

4. Y^'^) is a k-weak copy of^. 

Moreover, for every r] > 0, there exists Qk,r] satisfying points 1 to 4 above 
and such that 



dF 



< r]. 



Proof. Call X = {X„:n > 1} the canonical projection of the space 
{A'^,A^°°) to itself so that, if we endow {A°^,A^°°) with a probability 
measure P, X becomes a random element with law P. Now, it is immediate 
that under the probability measure given by 

1+ / TdD^'^+AdF, 

X satisfies points 1 to 3 in the statement: moreover, for every = (ji, . . . , 
3k)^Voo{k), 



due to the weak independence of X as well as the following relation, that is 
a consequence of de Finetti's theorem and of the fact that T is bounded, 

^&lM'Ur S nXj,.,„) = / T<iD«'«, P-a.s., 
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yielding point 4. The last assertion is an easy consequence of the above dis- 
cussion. 

□ 



To eventually construct such a T for a GUS X of parameters a and 
c > 0, maintain the notation of the proof of the above proposition, set 
jp _ ]p(Q!,c)^ that is, the law of X, and take a bounded and symmetric statistic 
y(Xfc_|„i). Theorem 11 implies that one can choose V such that the func- 
tional tt[V, SHk^i](X.k^i) is not only symmetric and different from zero, 
but also a-s.-P^"'"^^ equal to a finite linear combination of conditional expec- 
tations of V. It follows that for any i] G (0, 1), there exists e > such that 
e|7r[y, SHk+i] \ < r], P*^"''^)-a.s. It is shown in Peccati (2002a), that in this case 

so that it is sufficient to take T = eT:\V, Hk+i\- 

6.3. Covariance analysis. A standard combinatorial argument yields the 
following result that shows how the covariance of two centered and sym- 
metric statistics can be decomposed by means of the functions 0^*) defined 
in (25). 

Proposition 16 (Covariance decomposition). Under the assumptions 
of Theorem 11, let T and Z be two centered elements of (xjy"'*^^ ) , 1 < 
M < N, and let the functions and (ji^^ > s = 1, • • • , M , he defined by (25) . 

E[rZ] = JA/(s,c,a(A))E[05?^(X,)4'^(X,)], 



Then 

M 



s=l 

where 



s J ^o\Pj V s-p A f-J^a{A)+c{s + l-l)' 
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